K | # of bigrams | # of trigrams | # of 4-grams | # of 5-grams | # of 6-grams |
---|---|---|---|---|---|
100 | 52 | 83 | 89 | 95 | 98 |
1000 | 219 | 530 | 757 | 895 | 951 |
10000 | 647 | 2378 | 4526 | 6666 | 8083 |
100000 | 2195 | 9878 | 22928 | 39851 | 56312 |
1000000 | 2736 | 13894 | 36172 | 67037 | 100308 |
Both the problem and the results are much similar to the previous subsection: We consider letter-N-grams at the end of words instead of the beginning.
3.8.1 Number of letter-N-grams at word beginnings